Run one deployment orchestrator per deployment per cluster #1865

joshk · 2025-01-31T05:28:13Z

Currently we run one Deployments.Orchestrator per deployment per node, it doesn't matter if the deployment is active or not, it will still be run on each node.

If we have 100 deployments, and 10 nodes, then we have 100 orchestrators per node, and 1000 overalls.

This architecture, specifically having an orchestrator for a deployment run on every node, makes it very hard to ensure only a certain number of concurrent updates are running at the same time. And it also makes it hard to build out deployment workflows.

This PR uses ProcessHub to ensure one orchestrator is running per deployment across the cluster. Additionally, it only runs orchestrators that are set as active.

This ensures centralised management of a deployment, while using ProcessHub to redistribute orchestrators to other nodes if a node is to go offline.

The current implementation allows for us to switch between strategies.

If we commit to this path (after testing in the field), we can remove the device registry, simplifying device connections. Plus, it will also enable us to add support for other device transports and transport setups, like a socket proxy or mqtt.

To run this locally, open up two terminals and run:

DEPLOYMENTS_ORCHESTRATOR=clustered make iex-server-clustered

and

DEPLOYMENTS_ORCHESTRATOR=clustered make iex-server-clustered num=2

Orchestrators are set to only run on web nodes, which also builds up to us supporting a 'device proxy' setup.

Still required from this PR:

tests
testing in an isolated environment
specs and docs

lawik · 2025-01-31T07:44:38Z

Very cool.

Running orchestrators to do nothing does seem like a waste, though I guess orchestrators do have ad-hoc work as new devices join the deployment.

I guess throughput for highly concurrent rollouts (1000-2000 at a time) of hundreds of thousands of total devices is a bit of a question. How fast is it to have a single node doing that work?

The way orchestrators work now is unusual and we definitely want all this under test regardless of strategy. I think having multiple nodes share the work of pushing out updates has some upside in terms of efficiency and reliably getting updates out but it does have drawbacks for concurrency control and monitoring the process. I guess focusing it back into one node makes it a lot easier to check the results of the fleet health as the deployment rolls out as well.

I am overall in favor. I haven't worked with Horde before so don't instinctively trust it but have heard good things :D

lib/nerves_hub/devices.ex

lib/nerves_hub/deployments/distributed/orchestrator_registration.ex

lib/nerves_hub/deployments.ex

lib/nerves_hub/devices.ex

Device connections might be flakey, but the update might still be happening in the background.

If the inflight update firmware uuid matches the device metadata then the update was likely a success.

Otherwise the delay in starting can cause it to miss some broadcasts, and make testing a bit harder.

This adds a new event that is sent to the orchestrator when the device is fully 'online' and has gone through the `after_boot` and device registration steps.

If the devices firmware matches the deployments, there is no need to trigger an orchestrator run

Since we now tell the deployment that a device assigned to it is online, we can place the "device finished updating" broadcast in the right place

This reduces subscribers to `deployment:#{id}` receiving a bunch of messages which mean nothing to them

Add a 10sec buffer between orchestrator runs. This is done by using a `send_after` timer ref, tracking if another call has been made, and allowing for the buffer to be skipped during testing.

…es-hub/nerves_hub_web into distributed-deployment-orchestrator

A device now goes from `:connecting` to `:connected`, signifying that it is ready to receive updates. Using this different status allows us to tell the orchestrator to only schedule update for devices that have "finished" connecting

…vice

This provides a nice cleanup in our new orchestrator and tests. It's just a normal GenServer and doesn't know anything of `ProcessHub`, which is just a way of running it at scale.

joshk force-pushed the distributed-deployment-orchestrator branch from 4f2dfff to 4db1922 Compare February 1, 2025 08:33

joshk marked this pull request as ready for review February 3, 2025 05:02

joshk force-pushed the distributed-deployment-orchestrator branch from eb9d52a to 5730bfb Compare February 3, 2025 08:32

joshk requested review from jjcarstens, nshoes and lawik February 3, 2025 09:08

nshoes reviewed Feb 3, 2025

View reviewed changes

joshk added 22 commits February 4, 2025 08:11

Run one deployment orchestrator per deployment per cluster

c4ea461

Support devices telling the orchestrator that they might of updated

d626d08

Minor cleanup

4949b88

Don't forcefully clear inflight updates

dff7764

Device connections might be flakey, but the update might still be happening in the background.

Remove inflight updates where the firmware matches the device

0f44a8e

If the inflight update firmware uuid matches the device metadata then the update was likely a success.

Only message an orchestrator if the device has a deployment

975105f

Some cleanup, some docs, some encapsulation

19953da

Use singular where the event is for one deployment

104fd3b

Prefer Phoenix.Socket.Broadcast

1047693

Use init over handle_continue(:boot, _)

795675e

Otherwise the delay in starting can cause it to miss some broadcasts, and make testing a bit harder.

Tests and fixes from the tests

b5476c5

Test scheduling updating as devices come online

f0a5441

This adds a new event that is sent to the orchestrator when the device is fully 'online' and has gone through the `after_boot` and device registration steps.

Remove a commented out test which has been covered

5936c1f

Bump the Orchestrator timer interval to 90 secs

466a127

Fix the need to have a second start_link function on the orchestrator

c38c188

Remove an unneeded subscribe in a test

37a36ef

Allow the orchestrator to ignore the device-online

07b69e8

If the devices firmware matches the deployments, there is no need to trigger an orchestrator run

Only send the "device finished updating" broadcast if it did update

fa3edd0

Since we now tell the deployment that a device assigned to it is online, we can place the "device finished updating" broadcast in the right place

The original Monitor shouldn't have been changed

10579fd

Make it easier to start a cluster in dev

82da381

Only start the clustered orchestrator on the web nodes

cce4ce8

Switch from Horde to ProcessHub

ee871d2

joshk added 7 commits February 6, 2025 20:29

Change the topic the Orchestrator uses

0f74e93

This reduces subscribers to `deployment:#{id}` receiving a bunch of messages which mean nothing to them

Better management of the orchestrator mailbox

06243c1

Add a 10sec buffer between orchestrator runs. This is done by using a `send_after` timer ref, tracking if another call has been made, and allowing for the buffer to be skipped during testing.

Merge branch 'distributed-deployment-orchestrator' of github.com:nerv…

7d01c7b

…es-hub/nerves_hub_web into distributed-deployment-orchestrator

Reduce the buffer to 5 seconds

9876d56

[publish]

935235d

Fix a broadcast payload

a071eb8

[publish]

7c6d00f

joshk mentioned this pull request Feb 6, 2025

Question regarding alpha status alfetahe/process-hub#9

Closed

joshk added 12 commits February 7, 2025 09:37

Merge branch 'main' into distributed-deployment-orchestrator

a625c4e

Address dialyzer and credo warnings

b860f0a

Add a connection status of :connecting

e394ad8

A device now goes from `:connecting` to `:connected`, signifying that it is ready to receive updates. Using this different status allows us to tell the orchestrator to only schedule update for devices that have "finished" connecting

mix format is our savior

fb2ef27

Fix a tests assert

f79114d

Fix an issue where two firmware updates could be sent for the same de…

9780534

…vice

[publish]

2d3ef76

Merge branch 'main' into distributed-deployment-orchestrator

27f3c28

[publish]

07796b3

Notify the Orchestrator when a device is enabled for updates

4db5b5a

Don't trigger the Orchestrator if updates are blocked for the device

5e85294

Credo didn't like an if using brackets

70eacb5

joshk force-pushed the distributed-deployment-orchestrator branch from e3d41a1 to 70eacb5 Compare February 7, 2025 10:11

joshk and others added 9 commits February 7, 2025 23:14

[publish]

769860b

The deployment isn't loaded, so use device.deployment_id instead

99fe8ea

[publish]

65929ab

Merge branch 'main' into distributed-deployment-orchestrator

610232a

Dialyzer good times

4dc727e

New ProcessHub version allows for clean GenServer shutdown

2d5a8d6

This provides a nice cleanup in our new orchestrator and tests. It's just a normal GenServer and doesn't know anything of `ProcessHub`, which is just a way of running it at scale.

[publish]

fd27860

Merge branch 'main' into distributed-deployment-orchestrator

4adb989

[publish]

188d180

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run one deployment orchestrator per deployment per cluster #1865

Run one deployment orchestrator per deployment per cluster #1865

joshk commented Jan 31, 2025 •

edited

Loading

lawik commented Jan 31, 2025

Run one deployment orchestrator per deployment per cluster #1865

Are you sure you want to change the base?

Run one deployment orchestrator per deployment per cluster #1865

Conversation

joshk commented Jan 31, 2025 • edited Loading

lawik commented Jan 31, 2025

joshk commented Jan 31, 2025 •

edited

Loading